Le A 36 493-Foreien Countries Sp/AB/NT 1U/ D3f 014 

JC09Rec'dPCT/PTO Q3JUN 2005 

- 1 - 



ISOLATED FLUORESCENT PROTEIN FROM CLYTIA GREGARIA 
(CGFP) AND USE THEREOF 



The invention relates to the nucleotide and amino acid sequences and to the activity 
and use of the fluorescent protein CGFP (fluorescence protein of Clytia gregaria). 

Fluorescent proteins 

A multiplicity of coelenterates are bioluminescent (Morin et aL, 1974) and emit blue 
or green light. Aequoria victoria aequorin which was identified as the first light- 
producing protein in 1962 (Shimomura et aL, 1962) emitted, as an isolated protein, a 
blue light, rather than the phenotypically observed green light of Aequoria victoria. 
Later, the green fluorescent protein (GFP) was isolated from Aequoria victoria, 
which, owing to the excitation by aequorin, makes the medusa appear green 
phenotypically (Johnson et al, 1962; Hastings et aL, 1969; Inouye et al, 1994). 

Green fluorescent proteins have been isolated from different organisms. These 
include the Hydozoa (aequoria, halistaura obelia) and anthropods (acanthotilum, sea 
cactus, cavernularia, renila, ptilosarcus, stylatula) (Morin et al., 1971; Morin et al., 
1971 n, Wampler et al., 1971, Wampler et aL, 1973, Cormier et al., 1973, Cormier et 
aL, 1 974, Levine et aL, 1 982). 



A compilation of some fluorescent proteins can be found in Table 1 : 
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Table 1: 



Overview over some fluorescent proteins. Indicated are the name, tbe organism from 
which the protein has been isolated and the identification number (Acc. No.) of the 
database entry. 



Name 


Organism 


IdentificatioTnb. 


vjTeen iiuoresceni protein 


Aequorea macro dactyla 


AF435433 


vrreen iiuoresceni protein 


Aequoria 


L29345 


Green fluorescent rvrntein-liVf* nrntpin 


A OC3T*l f*1Q afTQTIPltPC 

f\QcLL IL-lct ct^ctllC-llCo 


J\ I Uj / / /D 


Green fluorescent protein-like protein 


Agaricia fragilis 


AY037765 


Green fluorescent protein 


Dendronephthya 


AF420591 


Red fluorescent protein 


Entacmaea quadricolor 


AY1 30757 


Green fluorescent protein-like protein 


Caribbean anemone 


AY037777 


Green fluorescent protein 


Heteractis crispa 


AF420592 


Green fluorescent protein-like protein 


Montastraea annularis 


AY037766 


Green fluorescent protein-like protein 


Montastraea cavernosa 


AY037768 


Cyan fluorescent protein 


Montastraea cavernosa 


AY056460 


Green fluorescent protein 


Renilla muelleri 


AY015996 


Green fluorescent protein 


Renilla renoformis 


AF372525 


Green fluorescent protein-like protein 


Ricordea florida 


AY037774 



The fluorescent proteins differ not only due to their nucleotide and amino acid 
sequences but also due to their biochemical and physical properties. The spectral 
characteristics of the fluorescent proteins may differ both on the side of excitation 
and on^the side of emission. An overview of the fluorescence spectra and the 
excitation wavelength can be found in Table 2. 
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Table 2: 



Overview over some fluorescent proteins. Indicated are the organism from which the 
protein has been isolated, the excitation wavelength and emission wavelength 
determined in spectral analyses. 



Organism 


Excitation 


Fluorescence 


Aequoria 


465-498 nm 


509 nm 


Halistaura 


465 nm 


497 nm 


Phialidium 


485 nm 


498 nm 


Renilla 


498 nm 


508 nm 



The use of fluorescent proteins has already been described previously. An overview 
can be found in Table 3: 

Table 3: 



Overview over some fluorescent proteins. Indicated are the organism from which the 
protein has been isolated, the name of the fluorescent protein and a selection of 
patents or applications. 



Organism 


Fluorescent protein 


Patent / application 


Renilla mulleri 


GFP 


US patent no. 6,436,682 

WO2001 68824 

WO200257451 

WO2001 34824 

WO9949019 

US patent no. 6,232,107 
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Organism 


Fluorescent protein 


Patent / application 


Aequoria 


GFP 


WO200071565 

W097 11094 

W09623898 

US patent no. 5,958,713 

US patent no. 6,172,188 



It was shown that it is possible to alter the physical and biochemical properties of 
fluorescent proteins by altering the amino acid sequence thereof. Examples of 
mutagenized fluorescent proteins have been described in the literature (Delagrave et 
5 al., 1995; Ehrig et al., 1995; Heim et al., 1996). 

Fluorescent proteins are already used in a wide variety of areas. The use of 
fluorescent proteins in " Fluorescence Resonance Energy Transfer' (FRET), 
'Bioluminescence Resonance Energy Transfer (BRET) and other energy transfer 
10 methods has already been described in the literature (Mitra et al., 1996; Ward et al., 
1978; Cardullo et al, 1988; US patent no. 4,777,128; US patent no. 5,126,508; US 
patent no. 4,927,923; US patent no. 5,279,943). Further nonradioactive methods of 
energy transfer by means of GFP have likewise been described previously (PCT appl. 
WO 98/02571 and WO 97/28261) 

15 

Reporter systems 



20 



Reporter gene or indicator gene generally refers to genes whose gene products can be 
readily detected with the aid of simple biochemical or histochemical methods. At 
least 2 types of reporter genes are distinguished. 
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1 . Resistance genes. Resistance genes refer to genes whose expression imparts 
to a cell the resistance to antibiotics or other substances whose presence in the 
growth medium leads to cell; death when the resistance gene is absent. 

2. Reporter gene. The products of reporter genes are used as fused or nonfused 
indicators in genetic engineering. The most commonly used reporter genes 
include beta-gaJactosidase (Alam et al., 1990), alkaline phosphatase (Yang et 
al. 5 1997; Cullen et al., 1992), luciferases and other photoproteins 
(Shinomura, 1985; Phillips GN, 1997; Snowdowne et ah, 1984). 

Luminescence refers to the emission of photons in the visible spectral range, said 
emission being effected by excited emitter molecules. In contrast to fluorescence, the 
energy, is supplied here not from the outside in the form of radiation of shorter 
wavelength. 

A distinction is made between chemiluminescence and bioluminescence. 
Chemiluminescence refers to a chemical reaction resulting in an excited molecule 
which luminesces itself when the excited electrons return to the ground state. If this 
reaction is catalysed by an enzyme, then this process is called bioluminescence. The 
enzymes involved in the reaction are generally referred to as luciferases. 

Classification of the species Clytia gregaria 

Cnidaria-* Leptomedusae-* Campanulariidae-* Clytia gregaria 

The species Clytia gregaria belongs to the cnidaria, especially to the medusae. The 
bioluminescent or fluorescent phenotype has already been described in 1 998 (Ward et 
al, 1998). 
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Isolation of cDNA 

In order to study the fluorescent activity of the species Clytia gregaria, specimens 
were caught in Friday Harbor in Washington State (USA) and stored in liquid 
5 nitrogen. The cDNA library was prepared using exclusively the bioluminescent ring 
of one medusa specimen. The Clytia gregaria cDNA libraries were generated by 
isolating the RNA by means of isothiocyanate according to the method of Krieg 
(KriegetaL, 1996). 

10 The cDNA was prepared by carrying out an RT-PCR. To this end, 10 jag of RNA 
were incubated with reverse transcriptase (Superscribt Gold II) according to the 
following plan: 





PCR 


1. 


30 


seconds 


95°C 


15 




2. 


6 


minutes 


68°C 






3. 


10 


seconds 


95°C 






4. 


6 


minutes 


68°C 



1 7 cycles of step 4 after step 3 

20 

In order to inactivate the polymerase, the reaction products were incubated at 37°C 
with proteinase . K for 30 minutes and cDNA was precipitated with ethanol. The 
cDNA was dissolved in water and incubated with Sfil at 37°C for one hour. The 
reaction products were gel-filtrated in order to remove small fragments. The 
25 ^ fractionated cDNA was then ligated into the Sfil-cut and dephosphorylated 
ATriplEx2 vector. A A.-phage expression library was then prepared by packing the 
cloned cDNA fragments into X phages by means of the in vitro packing system 
SMART cDNA Library Construction Kits (Clontech). 
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The recombinant phages containing a cDNA insertion with potential expression of 
fluorescent proteins were identified by carrying out a ^library screening". 

To this end, bacterial lawns of transformed E. coli XL 1 -Blue were plated on 90 mm 
5 culture dishes and incubated at 31°C for 12-15 hours. Induction of protein expression , 
was started by adding to the plates 60 [il of a 20 mM IPTG (isopropylthiogalactoside) 
solution. After 24 hours of incubation at room temperature, the plates were stored at 
4 °C for 72 hours. This was followed by measuring the fluorescence. 

10 To this end, the bacteria were irradiated using an argon laser (LGN502) with 100 mV 
at 488 nm or 366 nm (UVL21). The fluorescence was measured using a 510 nm ZSV 
filter. 

To isolate the clones and for spectral analysis, the biomass of fluorescence-positive 
15 clones was removed from the culture plates and resuspended in PBS (phosphate 
buffered saline). The cells were disrupted by means of ultrasound. After clarifying 
the lysate by centrifugation, the fluorescence of the supernatant was determined in a 
fluorimeter. 

20 A fluorescent protein was identified. The fluorescent protein was referred to as CGFP 
(fluorescence protein of clytia gregaria). The fluorescent protein CGFP is illustrated 
in detail below. 

CGFP 

25 \ 

The fluorescent protein CGFP shows the highest homology at the amino acid level to 
Aequoria GFP with an identity of 44% (depicted in Example 8; Figure 5). At the 
nucleic acid level, the identity is below 30% (depicted in Example 9; Figure 6). The 
BLAST method was used for sequence comparison (Altschul et al., 1 997). 



30 
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The invention also relates to functional equivalents of CGFP. Functional equivalents 
are those proteins which have comparable physicochemical properties and which are 
at least 70% homologous. Preference is given to a homology of 80% or 90%. 
Particular preference is given to a homology of 95%. 

The fluorescent protein CGFP is suitable as a reporter gene for cellular systems, 
especially for receptors, for ion channels, for transporters, for transcription factors or 
for inducible systems. 

The fluorescent protein CGFP is suitable as a reporter gene in bacterial and 
eukaryotic systems, especially in mammalian cells, in bacteria, in yeasts, in baculo, in 
plants. 

The fluorescent protein CGFP is suitable as a reporter gene for cellular systems in 
combination with bioluminescent or chemiluminescent systems, especially systems 
with luciferases, with oxygenases, with phosphatases. 

The fluorescent protein CGFP is suitable as a marker protein, especially in FACS 
(fluorescence activated cell sorter) sorting. 

The fluorescient protein CGFP is suitable as a fusion protein, especially for receptors, 
for ion channels, for transporters, for transcription factors, for proteinases, for 
kinases, for phosphodiesterases, for hydrolases, for peptidases, for transferases, for 
membrane proteins, for glycoproteins. 

The fluorescent protein CGFP is suitable for immobilization, especially by 
antibodies, by biotin, by magnetic or magnetizable carriers. 

The fluorescent protein CGFP is suitable as a protein for energy transfer systems, 
especially the FRET (Fluorescence Resonance Energy Transfer), BRET 
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(Bioluminescence Resonance Energy Transfer), FET (field effect transistors), FP 
(fluorescence polarization), HTRF (Homogeneous time-resolved fluorescence) 
systems. 

The fluorescent protein CGFP is suitable as a label of substrates or ligands, especially 
for proteases, for kinases, for transferases. 

The fluorescent protein CGFP is suitable for expression in bacterial systems, 
especially for titer determination, as substrates for biochemical systems, especially 
for proteinases and kinases. 

The fluorescent protein CGFP is suitable as a marker, especially coupled to 
antibodies, coupled to enzymes, coupled to receptors, coupled to ion channels and 
other proteins. 

The fluorescent protein CGFP is suitable as a reporter gene in pharmacological drug 
screening, especially in HTS (High Throughput Screening). 

The fluorescent protein CGFP is suitabte as a component of detection systems, 
especially for ELISA (enzyme-linked immunosorbent assay), for 
immunohistochemistry, for Western blotting, for confocal microscopy. 

The fluorescent protein CGFP is suitable as a marker for the analysis of interactions, 
especially for protein-protein interactions, for DNA-protein interactions, for DNA- 
RNA interactions, for RNA-RNA interactions, for RNA-protein interactions (DNA : 
deoxyribonucleic acid; RNA : ribonucleic acid). 

The fluorescent protein CGFP is suitable as a marker or fusion protein for expression 
in transgenic organisms, especially in mice, in rats, in hamsters and other mammals, 
in primates, in fish, in worms, in plants. 
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The fluorescent protein CGFP is suitable as a marker or fusion protein for analysing 
embryonic development. 

The fluorescent protein CGFP is suitable as a marker via a coupling mediator, 
especially via biotin, via NHS (N-hydroxysulphosuccimides) 5 via CN-Br. 

The fluorescent protein CGFP is suitable as a reporter coupled to nucleic acids, 
especially to DNA, to RNA. 

The fluorescent protein CGFP is suitable as a reporter coupled to proteins or 
peptides. 

The fluorescent protein CGFP coupled to nucleic acids or peptides is suitable as a 
probe, especially for Northern blots, for Southern blots, for Western blots, for 
ELISA, for nucleic acid sequencings, for protein analyses, chip analyses. 

The fluorescent protein CGFP is suitable as a label of pharmacological formulations, 
especially of infectious agents, of antibodies, of small molecules. 

The fluorescent protein CGFP is suitable for geological studies, especially for sea 
currents, groundwater currents and river currents. 

The fluorescent protein CGFP is suitable for expression in expression systems, 
especially in in-vitro translation systems, in bacterial systems, in yeast systems, in 
baculo systems, in viral systems, in eukaryotic systems. 

The fluorescent protein CGFP is suitable for visualizing tissues or cells in surgical 
procedures, especially in invasive, in noninvasive, in minimally invasive procedures. 
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The fluorescent protein CGFP is also suitable for labeling tumour tissues and other 
phenotypically altered tissues, especially in histological examination, in surgical 
interventions. 

5 The invention also relates to purifying the fluorescent protein CGFP, especially as 
wild-type protein, as fusion protein, as mutagenized protein. 

The invention also relates to the use of the fluorescent protein CGFP in the field of 
cosmetics, especially of bath additives, of lotions, of soaps, of body paints, of 
1 0 toothpaste, of body powders. 

The invention also relates to the use of the fluorescent protein CGFP for colouring, 
especially of food, of bath additives, of ink, of textiles, of plastics. 

15 The invention also relates to the use of the fluorescent protein CGFP for the 
colouring of paper, especially of greeting cards, of paper products, of wallpapers, of 
handicraft articles. 

The invention also relates to the use of the fluorescent protein CGFP for the 
20 colouring of liquids, especially for water pistols, for fountains, for beverages, for ice 
cream. 

The invention also relates to the use of the fluorescent protein CGFP for producing 
toys, especially finger paints, face paints. 

25 

The invention relates furthermore to nucleic acid molecules, selected from the group 
consisting of 

a) nucleic acid molecules encoding the polypeptide disclosed by SEQ ID NO: 2; 

30 
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b) nucleic acid molecules containing the sequence depicted by SEQ ID NO: 1 ; 

c) nucleic acid molecules whose complementary strand hybridizes under 
stringent conditions with a nucleic acid molecule of a) or b) and which have 
the biological- function of a fluorescent protein; 

d) nucleic acid molecules which differ from those mentioned under c) due to the 
degeneracy of the genetic code; 

e) nucleic acid molecules whose sequences are at least 95% homologous to 
SEQ ID NO: 1 and which have the biological function of a fluorescent 
protein; and 

f) nucleic acid molecules whose sequences are at least 65% homologous to 
SEQ ID NO: 1 and which have the biological function of a fluorescent 
protein. 

The invention relates to the abovementioned nucleic acid molecules whose sequence 
contains a functional promoter 5 A of the sequence. 

The invention also relates to nucleic acid molecules as described above which are a 
part of recombinant DNA or RNA vectors. 

The invention relates to organisms which contain such a vector. 

The invention refers to oligonucleotides, having more than 10 contiguous nucleotides 
which are identical or complementary to the DNA or RNA sequence of the CGFP 
molecules. 
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The invention relates to fluorescent proteins which are encoded by the above- 
described nucleotide sequences. 

The invention refers to methods of expressing the fluorescent polypeptides according 
to the invention in bacteria, eukaryotic cells or in in vitro expression systems. 

The invention also relates to methods of purifying/isolating a fluorescent polypeptide 
according to the invention. 

The invention relates to peptides, having more than 5 contiguous amino acids which 
are recognized immunologically by antibodies to the fluorescent proteins according 
to the invention. 

The invention relates to the use of the fluorescent proteins according to the invention 
as marker genes and reporter genes, in particular for pharmacological drug screening 
and diagnostics. 

Expression of the fluorescent proteins according to the invention 

Expression refers to the production of a molecule which, after introduction of the 
gene into a suitable host cell, allows the foreign gene cloned into an expression 
vector to be transcribed and translated. Expression vectors contain the control signals 
required for expression of genes in cells of prokaryotes or eukaryotes. 

Expression vectors may be constructed in principle in two different ways. In the case 
of „transcription fusions", the protein encoded by the cloned-in foreign gene is 
synthesized as an authentic, biologically active protein. To this end, the expression 
vector carries all 5' and 3" control signals required for expression. 



Le A 36 493-Foreign Countries 



-14- 

In the case of „translation fusions", the protein encoded by the cloned-in foreign gene 
is expressed as a hybrid protein together with another protein which can be readily 
detected. The 5' and 3' control signals required for expression, including the start 
codon and possibly part of the sequences coding for the N-terminal regions of the 
5 hybrid protein to be formed, are derived from the vector. The additional protein part 
introduced not only stabilizes, in many cases, the protein encoded by the cloned-in 
foreign gene against degradation by cellular proteases but can also be used for 
detecting and isolating the hybrid protein formed. Expression may be either transient 
or stable. Suitable host organisms are bacteria, yeasts, viruses and eukaryotic 
10 systems. 

Purification of the fluorescent proteins according to the invention 

The isolation of proteins (also after overexpression) is frequently referred to as 
1 5 protein purification. A multiplicity of established methods and processes are 
available for protein purification. 

Solid-liquid separation is a basic operation in protein isolation procedures. This 
process step is required both in the removal of cells from the culture medium and in 
20 clarifying the crude extract, after cell disruption and removal of the cell debris, in 
removing precipitates after precipitations, etc. It is carried out by way of 
centrifugation and filtration. 

By obtaining intracellular proteins requires the cell wall to be destroyed or made 
25 ^ permeable. Depending on the scale and organism, high-pressure homogenizers or 
stirred ball mills or glass bead mills are used for this purpose. On the laboratory 
scale, mechanical cell integration and ultrasound treatment are used inter alia. 

Various precipitation methods involving salts (in particular ammonium sulphate) or 
30 organic solvents (alcohols, acetone) are a rapid and deficient method of concentrating 
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proteins, both for extracellular and intracellular proteins (after cell disruption). When 
purifying intracellular proteins, removal of the soluble nucleic acids is desirable 
(precipitation with streptomycin or protamine sulphate, for example). The recovery 
of extracellular proteins frequently involves the addition of carriers (e.g. starch, 
kieselguhr), prior to addition of the precipitants, in order to obtain better manageable 
precipitates. 

Numerous chromatographic and partition methods are available for fine purification 
(absorption chromatography and ion exchange chromatography, gel filtration, affinity 
chromatography, electrophoreses). Column chromatography is also applied on the 
industrial scale. Affinity chromatography which makes purification factors of up to 
several 100 per step possible is especially important on the laboratory scale. 

Extracellular proteins are obtained in relatively diluted solutions. Like extracellular 
proteins, they must be concentrated prior to further use. Ultrafiltration is a proven 
method, also on the industrial scale, in addition to the methods already mentioned. 

Inorganic salts are frequently undesired accompanying substances of proteins in 
specific applications. They may be removed, inter alia, by gel filtration, dialysis and 
diafiltration. 

Numerous proteins are used as dry preparations. Important drying processes are 
vacuum drying, freeze drying and spray drying. 

Nucleotide and amino acid sequences 

The fluorescent protein CGFP is encoded by the following nucleotide sequence 
(SEQ ID NO: 1): 

atgactgcacttaccgaaggagcaaaactgttcgagaaagaaattccctacattaca 
gagttggaaggagacgttgaaggaatgaaattcatcatcaaaggtgaaggtactggc 
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gacgctactactggcaccatcaaagcgaaatatatttgcacaactggtgaccttcct 
gtaccatgggctaccatcttgagtagtttgtcgtatggtgttttctgtttcgctaag 
tatccacgccacattgccgactttttcaagagcacacaaccagatggttattcacaa 
gacagaatcattagttttgacaatgatggacaatacgatgtcaaagccaaggttact 
5 tatgaaaacggaacactttataatagagtcacagtcaaaggtactggcttcaaatca 
aacggcaacatccttggtatgagagttctctaccattcaccaccacacgctgtctac 
atccttcctgaccgtaaaaatggtggcatgaaaattgaatacaataaggctttcgac 
gttatgggcggtggtcaccaaatggcgcgtcacgcccaattcaataaaccactagga 
. gcctgggaagaagattatccgttgtatcatcatcttaccgtatggacttctttcgga 
10 aaagatccggatgatgatgaaactgaccatttgaccatcgtcgaagtcatcaaagct 
gttgatttggaaacataccgttga-3 N . 

This results in an amino acid sequence of (SEQ ID NO: 2): 

MTALTEGAKLFEKEIPYITELEGDVEGMKFIIKGEGTGDATTGTIKAKYICTTGDLP 
15 VPWATILSSLSYGVFCFAKYPRHIADFFKSTQPDGYSQDRIISFDNDGQYDVKAKVT 
YENGTLYNRVTVKGTGFKSNGNILGMRVLYHSPPHAVYILPDRKNGGMKIEYNKAFD 
VMGGGHQMARHAQFNKPLGAWEEDYPLYHHLTVWTSFGKDPDDDETDHLTIVEVIKA 
VDLETYR 

20 These sequences can be found in the sequence listing. 
Description of the figures 

Fig. 1 depicts the plasmid map of the pTriplEX2-CGFP vector. 

25 

Fig. 2 depicts the plasmid map of the pcDNA3-CGFP vector. 

*•* 

Fig. 3 depicts the transient expression of CGFP in CHO cells in the pcDNA3-CGFP 
expression vector. The figure depicts the microscopic image of the transiently 
30 transfected cells, at an excitation of 480 nm and an emission of 520 nm. 

Fig. 4 depicts the excitation of CGFP and of the control lysate. 
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Fig. 5 depicts the emission of CGFP and of the control lysate. 

Fig. 6 depicts the alignment of CFGP, GFP (Aquoria) and GFP (Renilla) at the amino 
acid level. 

CGFP_Cly : CGFP from Clytia gregaria 
GFP Ren : GFP from Renilla 
GFP_Aeq . GFP from Aequoria 

Fig. 7 depicts the alignment of CFGP, GFP (Aquoria) and GFP (Renilla) at the 
nucleic acid level. 

CGFP_Cly : CGFP from Clytia gregaria 
GFP_Ren: GFP from Renilla 
GFP_Aeq . GFP from Aequoria 
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Examples 
Example 1 

The vector used for preparing the construct illustrated below was the plasmid 
pTriplEx2 from Clontech. The derivative of said vector was referred to as pTriplEx2- 
CGFP. The pTriplEx2-CGFP vector was used for expressing CGFP in bacterial 
systems. 

Fig. 1 depicts the plasmid map of the pTriplEX2-CGFP vector. 
Example 2 

The vector used for preparing the construct illustrated below was the plasmid 
pcDNA3.1(+) from Clontech. The derivative of said vector was referred to as 
pcDNA3-CGFP. The pcDNA3-CGFP vector was vised for expressing CGFP in 
eukaryotic systems. 

Fig. 2 depicts the plasmid map of the pcDNA3-CGFP vector. 
Example 3 
Bacterial expression 

Bacterial expression was carried out in the E. coli strain BL21(DE3) by transforming 
the bacteria with the expression plasmids pTriplEX2-CGFP and pTriplEX2. The 
transformed bacteria were incubated in LB medium at 37°C for 3 hours and 
expression was induced for 4 hours by adding IPTG up to a final concentration of 
1 mM. The induced bacteria were harvested by centrifugation, resuspended in PBS 
and sonicated. The fluorescence was determined with the aid of a fluorimeter. 
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Example 4 

Eukaryotic expression 

Constitutive eukaryotic expression was carried out in CHO cells by transfecting said 
cells with the expression plasmids pcDNA3-CGFP and pcDNA3.1(+) in transient 
experiments. To this end, 10000 cells per well were plated in DMEM-F12 medium 
on 96-well microtiter plates and incubated at 37°C overnight. Transfection was 
carried out with the aid of the Fugene 6 kit (Roche) according to the manufacturer's 
information. The transfected cells were incubated in DMEM-F12 medium at 37°C 
overnight. The fluorescence was measured in a fluorimeter at room temperature. 

Figure 3 depicts expression of CGFP in CHO cells. 

Example 5 

Spectrum of the fluorescent protein CGFP 

To measure the emission spectrum, E. coli BL21(DE3) were transformed with the 
plasmids pTriplEX2-CGFP and pTriplEX2. Induction was carried out by adding 
1 mM IPTG and incubating at 37 °C for 4 hours. The bacteria were subsequently 
harvested and resuspended in PBS. The lysis was carried out using ultrasound. 
Subsequently, fluorescence was measured in a fluorimeter. 

Figure 4 depicts the excitation of CGFP and of the control lysate. 

Figure 5 depicts the emission of CGFP and of the control lysate. 
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Example 6 



BLAST 



Result of a BLAST analysis of CFGP at the amino acid level. 



>AA2002: ABB06186 Abb06186 Green fluorescent protein GFPxml9 SEQ ID, NO: 15. 
5/2002, Length = 271, Score = 219 bits (558), Expect = 3e-56, Identities = 
102/228 (44%), Positives = 151/228 (65%), Gaps = 3/228 (1%) 

>gb| AAK02065. 1 f mutant green fluorescent protein [synthetic construct], 
Length - 238, Score « 219 bits (557), Expect = 4e-56, Identities - 102/227 
(44%), Positives - 150/227 (65%), Gaps - 3/227 (1%) 

>gb I AAL33915. 1 | AF435430__1 green fluorescent protein [Aequorea macrodactyla] , 
Length - 238, Score = 218 bits (556), Expect = 5e-56, Identities = 102/227 
(44%), Positives - 150/227 (65%), Gaps - 3/227 (1%) 

>gb | AAL33918 . 1 | AF435433_1 green fluorescent protein [Aequorea macrodactyla], 
Length = 238, Score - 218 bits (555), Expect = 7e-56, Identities - 101/227 
(44%), Positives = 149/227 (65%), Gaps =3/227 (1%) 

>gb I AAL33916. 1 | AF435431_1 green fluorescent protein [Aequorea macrodactyla], 
Length = 238, Score 218 bits (554), Expect = 9e-56 
Identities = 102/227 (44%), Positives = 150/227 (65%), Gaps = 3/227 (1%) 

>gb|AAL33917.1|AF435432_l orange fluorescent protein [Aequorea 

macrodactyla], Length = 238, Score « 218 bits (554), Expect = 9e-56, 
Identities - 101/227 (44%), Positives = 149/227 (65%), Gaps = 3/227 (1%) 

>AA2002 : ABB06185 Abb06185 Green fluorescent protein GFPxml8 SEQ ID, NO:13. 
5/2002, Length = 271, Score = 217 bits (552), Expect « le-55, Identities = 
101/228 (44%), Positives = 151/228 (65%), Gaps = 3/228 (1%) 

>AA2002: ABB06184 Abb06184 Green fluorescent protein GFPxml6 SEQ ID, NO: 11. 
5/2002, Length = 271, Score = 216 bits (551), Expect - 2e-55, Identities - 
101/228 (44%), Positives = 150/228 (65%), Gaps = 3/228 (1%) 

>AA2002: ABB06181 Abb06181 Green fluorescent protein GFPxm SEQ ID, NO:5. ■ 
5/2002, Length - 271, Score = 216 bits (551), Expect = 2e-55, Identities = 
101/228 (44%), Positives = 150/228 (65%), Gaps = 3/228 (1%) 

>gb | AAL33912 . 1 | AF435427_1 green fluorescent protein [Aequorea macrodactyla], 
Length « 238, Score - 216 bits (551), Expect - 2e-55, Identities - 101/227 
(44%), Positives = 150/227 (65%), Gaps = 3/227 (1%) 

.>gb | AAK02064 . 1 I mutant green fluorescent protein [synthetic construct], 
Length = 238, Score = 216 bits (551), Expect - 2e-55, Identities = 101/227 
(44%), Positives - 150/227 (65%), Gaps = 3/227 (1%) 
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Example 7 



BLAST 



5 Result of a BLAST analysis of CFGP at the nucleic acid level. 



>gb | AF4 68563 . 1 | Crassostrea gigas clone c077 micro-satellite sequence, Length 
= 415, Score - 41.1 bits (21), Expect = 1.4, Identities = 25/27 (92%) 

10 >gb I AC079685 . 2 | Oryza sativa chromosome 10 clone OS J"NBb0012A20, complete 

■ sequence, Length = 131599, Score = 41.1 bits (21), Expect = 1.4, Identities 
= 27/30 (90%) 

>gb|AF427906.1 IAF427906 Solenopsis globularia littoralis putative odorant 
15 binding protein, precursor (Gp-9) gene, complete cds , Length = 17 67, Score = 

41.1 bits (21), Expect « 1.4, Identities = 23/24 (95%) 

>gb I AF2 97 617 . 1 | AF297 617 Echinococcus granulosus genotype 1 mitochondrion, 
complete genome, Length = 13588, Score = 41.1 bits (21), Expect 1.4, 
20 Identities = 23/24 (95%) 



Example 8 

25 Figure 6 depicts the alignment of CFGP 5 GFP (Aquoria) and GFP (Renilla) at the 
nucleic acid level. 

Example 9 

30 Figure 7 depicts the alignment of CFGP, GFP (Aquoria) and GFP (Renilla) at the 
amino acid level. 
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